Language-Agnostic Event Detection Across Sports from Twitter Using Temporal Features

نویسنده

  • Cody Buntain
چکیده

A significant body of work focuses on detecting events in social media, much of which relies on restrictive assumptions like prior event-centric filtering or known language models. Rather than build these expensive pre-processing and filtering pipelines, we explore language-agnostic techniques using only temporal features like token frequency. We construct temporal features for tokens in an unfiltered Twitter stream, build a classifier to recognize tokens experiencing bursts in usage, and relate those bursts back to known occurrences within sports. Our results demonstrate preliminary feasibility in rapidly detecting tokens that correspond to highimpact occurrences within sporting events. We also successfully transfer a model trained on American football and hockey data to previously unseen types of sports like the 2014 World Cup and the 2014 Kentucky Derby/Belmont Stakes horse races. This model has less success at identifying events in regular-season Major League Baseball, so more research is required to enhance these capabilities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Model for Detecting of Persian Rumors based on the Analysis of Contextual Features in the Content of Social Networks

The rumor is a collective attempt to interpret a vague but attractive situation by using the power of words. Therefore, identifying the rumor language can be helpful in identifying it. The previous research has focused more on the contextual information to reply tweets and less on the content features of the original rumor to address the rumor detection problem. Most of the studies have been in...

متن کامل

Learning to Discover Key Moments in Social Media Streams

This paper introduces LABurst, a general technique for identifying key moments, or moments of high impact, in social media streams without the need for domain-specific information or seed keywords. We leverage machine learning to model temporal patterns around bursts in Twitter’s unfiltered public sample stream and build a classifier to identify tokens experiencing these bursts. We show LABurst...

متن کامل

A Survey of Techniques for Event Detection in Twitter

Twitter is among the fastest-growing microblogging and online social networking services. Messages posted on Twitter (tweets) have been reporting everything from daily life stories to the latest local and global news and events. Monitoring and analyzing this rich and continuous user-generated content can yield unprecedentedly valuable information, enabling users and organizations to acquire act...

متن کامل

Using Twitter to Detect and Tag Important Events in Sports Media

In this paper we examine the effectiveness of using a filtered stream of tweets from Twitter to automatically identify events of interest within the video of live sports transmissions. We show that using just the volume of tweets generated at any moment of a game actually provides a very accurate means of event detection, as well as an automatic method for tagging events with representative wor...

متن کامل

High-Order Sequence Modeling for Language Learner Error Detection

We address the problem of detecting English language learner errors by using a discriminative high-order sequence model. Unlike most work in error-detection, this method is agnostic as to specific error types, thus potentially allowing for higher recall across different error types. The approach integrates features from many sources into the error-detection model, ranging from language model-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014